A Javaspace-Based Framework for Efficient Fault-Tolerant Master-Worker Distributed Applications
Identifieur interne : 002460 ( Main/Exploration ); précédent : 002459; suivant : 002461A Javaspace-Based Framework for Efficient Fault-Tolerant Master-Worker Distributed Applications
Auteurs : Virginie Galtier [France] ; Constantinos Makassikis [France] ; Stéphane Vialle [France]Source :
Abstract
We propose a framework built around a Java Space to ease the development of bag-of-tasks applications. The framework may optionally and automatically tolerate transient crash failures occurring on any of the distributed elements. It relies on check pointing and underlying middleware mechanisms to do so. To further improve check pointing efficiency, both in size and frequency, the programmer can introduce intermediate user-defined checkpoint data and code within the task processing program. The framework used without fault tolerance accelerates application development, does not introduce runtime overhead and yields to expected speedup. When enabling fault tolerance, our framework allows, despite failures, correct completion of applications with limited runtime and data storage overheads. Experiments run with up to 128 workers study the impact of some user-related and implementation-related on overall performance, and reveal good performances for classical Java Space-based master-worker application profiles.
Url:
DOI: 10.1109/PDP.2011.82
Affiliations:
Links toward previous steps (curation, corpus...)
- to stream Hal, to step Corpus: 000266
- to stream Hal, to step Curation: 000266
- to stream Hal, to step Checkpoint: 001F45
- to stream Main, to step Merge: 002510
- to stream Main, to step Curation: 002460
Le document en format XML
<record><TEI><teiHeader><fileDesc><titleStmt><title xml:lang="en">A Javaspace-Based Framework for Efficient Fault-Tolerant Master-Worker Distributed Applications</title>
<author><name sortKey="Galtier, Virginie" sort="Galtier, Virginie" uniqKey="Galtier V" first="Virginie" last="Galtier">Virginie Galtier</name>
<affiliation wicri:level="1"><hal:affiliation type="laboratory" xml:id="struct-26305" status="VALID"><orgName>SUPELEC-Campus Metz</orgName>
<desc><address><addrLine>2 rue Edouard Belin 57070 Metz</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.metz.supelec.fr/metz/</ref>
</desc>
<listRelation><relation active="#struct-300812" type="direct"></relation>
</listRelation>
<tutelles><tutelle active="#struct-300812" type="direct"><org type="institution" xml:id="struct-300812" status="VALID"><orgName>SUPELEC</orgName>
<desc><address><country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>France</country>
</affiliation>
</author>
<author><name sortKey="Makassikis, Constantinos" sort="Makassikis, Constantinos" uniqKey="Makassikis C" first="Constantinos" last="Makassikis">Constantinos Makassikis</name>
<affiliation wicri:level="1"><hal:affiliation type="laboratory" xml:id="struct-26305" status="VALID"><orgName>SUPELEC-Campus Metz</orgName>
<desc><address><addrLine>2 rue Edouard Belin 57070 Metz</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.metz.supelec.fr/metz/</ref>
</desc>
<listRelation><relation active="#struct-300812" type="direct"></relation>
</listRelation>
<tutelles><tutelle active="#struct-300812" type="direct"><org type="institution" xml:id="struct-300812" status="VALID"><orgName>SUPELEC</orgName>
<desc><address><country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>France</country>
</affiliation>
</author>
<author><name sortKey="Vialle, Stephane" sort="Vialle, Stephane" uniqKey="Vialle S" first="Stéphane" last="Vialle">Stéphane Vialle</name>
<affiliation wicri:level="1"><hal:affiliation type="laboratory" xml:id="struct-26305" status="VALID"><orgName>SUPELEC-Campus Metz</orgName>
<desc><address><addrLine>2 rue Edouard Belin 57070 Metz</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.metz.supelec.fr/metz/</ref>
</desc>
<listRelation><relation active="#struct-300812" type="direct"></relation>
</listRelation>
<tutelles><tutelle active="#struct-300812" type="direct"><org type="institution" xml:id="struct-300812" status="VALID"><orgName>SUPELEC</orgName>
<desc><address><country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>France</country>
</affiliation>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">HAL</idno>
<idno type="RBID">Hal:hal-00618249</idno>
<idno type="halId">hal-00618249</idno>
<idno type="halUri">https://hal-supelec.archives-ouvertes.fr/hal-00618249</idno>
<idno type="url">https://hal-supelec.archives-ouvertes.fr/hal-00618249</idno>
<idno type="doi">10.1109/PDP.2011.82</idno>
<date when="2011-02-09">2011-02-09</date>
<idno type="wicri:Area/Hal/Corpus">000266</idno>
<idno type="wicri:Area/Hal/Curation">000266</idno>
<idno type="wicri:Area/Hal/Checkpoint">001F45</idno>
<idno type="wicri:explorRef" wicri:stream="Hal" wicri:step="Checkpoint">001F45</idno>
<idno type="wicri:Area/Main/Merge">002510</idno>
<idno type="wicri:Area/Main/Curation">002460</idno>
<idno type="wicri:Area/Main/Exploration">002460</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title xml:lang="en">A Javaspace-Based Framework for Efficient Fault-Tolerant Master-Worker Distributed Applications</title>
<author><name sortKey="Galtier, Virginie" sort="Galtier, Virginie" uniqKey="Galtier V" first="Virginie" last="Galtier">Virginie Galtier</name>
<affiliation wicri:level="1"><hal:affiliation type="laboratory" xml:id="struct-26305" status="VALID"><orgName>SUPELEC-Campus Metz</orgName>
<desc><address><addrLine>2 rue Edouard Belin 57070 Metz</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.metz.supelec.fr/metz/</ref>
</desc>
<listRelation><relation active="#struct-300812" type="direct"></relation>
</listRelation>
<tutelles><tutelle active="#struct-300812" type="direct"><org type="institution" xml:id="struct-300812" status="VALID"><orgName>SUPELEC</orgName>
<desc><address><country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>France</country>
</affiliation>
</author>
<author><name sortKey="Makassikis, Constantinos" sort="Makassikis, Constantinos" uniqKey="Makassikis C" first="Constantinos" last="Makassikis">Constantinos Makassikis</name>
<affiliation wicri:level="1"><hal:affiliation type="laboratory" xml:id="struct-26305" status="VALID"><orgName>SUPELEC-Campus Metz</orgName>
<desc><address><addrLine>2 rue Edouard Belin 57070 Metz</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.metz.supelec.fr/metz/</ref>
</desc>
<listRelation><relation active="#struct-300812" type="direct"></relation>
</listRelation>
<tutelles><tutelle active="#struct-300812" type="direct"><org type="institution" xml:id="struct-300812" status="VALID"><orgName>SUPELEC</orgName>
<desc><address><country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>France</country>
</affiliation>
</author>
<author><name sortKey="Vialle, Stephane" sort="Vialle, Stephane" uniqKey="Vialle S" first="Stéphane" last="Vialle">Stéphane Vialle</name>
<affiliation wicri:level="1"><hal:affiliation type="laboratory" xml:id="struct-26305" status="VALID"><orgName>SUPELEC-Campus Metz</orgName>
<desc><address><addrLine>2 rue Edouard Belin 57070 Metz</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.metz.supelec.fr/metz/</ref>
</desc>
<listRelation><relation active="#struct-300812" type="direct"></relation>
</listRelation>
<tutelles><tutelle active="#struct-300812" type="direct"><org type="institution" xml:id="struct-300812" status="VALID"><orgName>SUPELEC</orgName>
<desc><address><country key="FR"></country>
</address>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>France</country>
</affiliation>
</author>
</analytic>
<idno type="DOI">10.1109/PDP.2011.82</idno>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc><textClass></textClass>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en">We propose a framework built around a Java Space to ease the development of bag-of-tasks applications. The framework may optionally and automatically tolerate transient crash failures occurring on any of the distributed elements. It relies on check pointing and underlying middleware mechanisms to do so. To further improve check pointing efficiency, both in size and frequency, the programmer can introduce intermediate user-defined checkpoint data and code within the task processing program. The framework used without fault tolerance accelerates application development, does not introduce runtime overhead and yields to expected speedup. When enabling fault tolerance, our framework allows, despite failures, correct completion of applications with limited runtime and data storage overheads. Experiments run with up to 128 workers study the impact of some user-related and implementation-related on overall performance, and reveal good performances for classical Java Space-based master-worker application profiles.</div>
</front>
</TEI>
<affiliations><list><country><li>France</li>
</country>
</list>
<tree><country name="France"><noRegion><name sortKey="Galtier, Virginie" sort="Galtier, Virginie" uniqKey="Galtier V" first="Virginie" last="Galtier">Virginie Galtier</name>
</noRegion>
<name sortKey="Makassikis, Constantinos" sort="Makassikis, Constantinos" uniqKey="Makassikis C" first="Constantinos" last="Makassikis">Constantinos Makassikis</name>
<name sortKey="Vialle, Stephane" sort="Vialle, Stephane" uniqKey="Vialle S" first="Stéphane" last="Vialle">Stéphane Vialle</name>
</country>
</tree>
</affiliations>
</record>
Pour manipuler ce document sous Unix (Dilib)
EXPLOR_STEP=$WICRI_ROOT/Wicri/Lorraine/explor/InforLorV4/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 002460 | SxmlIndent | more
Ou
HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 002460 | SxmlIndent | more
Pour mettre un lien sur cette page dans le réseau Wicri
{{Explor lien |wiki= Wicri/Lorraine |area= InforLorV4 |flux= Main |étape= Exploration |type= RBID |clé= Hal:hal-00618249 |texte= A Javaspace-Based Framework for Efficient Fault-Tolerant Master-Worker Distributed Applications }}
This area was generated with Dilib version V0.6.33. |